A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization

نویسندگان

  • Dixian Zhu
  • Changjie Cai
  • Tianbao Yang
  • Xun Zhou
چکیده

In this paper, we tackle air quality forecasting by using machine learning approaches to 1 predict the hourly concentration of air pollutants (e.g., Ozone, PM2.5 and Sulfur Dioxide). Machine 2 learning, as one of the most popular techniques, is able to efficiently train a model on big data by using 3 large-scale optimization algorithms. Although there exists some works applying machine learning 4 to air quality prediction, most of the prior studies are restricted to small scale data and simply train 5 standard regression models (linear or non-linear) to predict the hourly air pollution concentration. 6 In this work, we propose refined models to predict the hourly air pollution concentration based 7 on meteorological data of previous days by formulating the prediction of 24 hours as a multi-task 8 learning problem. It enables us to select a good model with different regularization techniques. We 9 propose a useful regularization by enforcing the prediction models of consecutive hours to be close 10 to each other, and compare with several typical regularizations for multi-task learning including 11 standard Frobenius norm regularization, nuclear norm regularization, `2,1 norm regularization. Our 12 experiments show the proposed formulations and regularization achieve better performance than 13 existing standard regression models and existing regularizations. 14

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine learning algorithms in air quality modeling

Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...

متن کامل

Stock Price Prediction using Machine Learning and Swarm Intelligence

Background and Objectives: Stock price prediction has become one of the interesting and also challenging topics for researchers in the past few years. Due to the non-linear nature of the time-series data of the stock prices, mathematical modeling approaches usually fail to yield acceptable results. Therefore, machine learning methods can be a promising solution to this problem. Methods: In this...

متن کامل

Application of ensemble learning techniques to model the atmospheric concentration of SO2

In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...

متن کامل

Prediction of Breast Tumor Malignancy Using Neural Network and Whale Optimization Algorithms (WOA)

Introduction: Breast cancer is the most prevalent cause of cancer mortality among women. Early diagnosis of breast cancer gives patients greater survival time. The present study aims to provide an algorithm for more accurate prediction and more effective decision-making in the treatment of patients with breast cancer. Methods: The present study was applied, descriptive-analytical, based on the ...

متن کامل

PREDICTION OF SLOPE STABILITY STATE FOR CIRCULAR FAILURE: A HYBRID SUPPORT VECTOR MACHINE WITH HARMONY SEARCH ALGORITHM

The slope stability analysis is routinely performed by engineers to estimate the stability of river training works, road embankments, embankment dams, excavations and retaining walls. This paper presents a new approach to build a model for the prediction of slope stability state. The support vector machine (SVM) is a new machine learning method based on statistical learning theory, which can so...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017